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(N| ; 1. INTRODUCTION 

A common pseudorandom number generator is the power generator: x i— > x e (mod n). 
Here, I, n are fixed integers at least 2, and one constructs a pseudorandom sequence by 
starting at some residue mod n and iterating this £th power map. (Because it is the easiest 
to compute, one often takes 1 = 7.) this case is known as the BBS generator, for Blum, 
Blum, and Shub.) To be a good generator, the period should be large. Of course, the pe- 
riod depends somewhat on the number chosen for the initial value. However, a universal 
^ \ upper bound for this period is A(A(n)) where A is Carmichael's function. Here, A(m) is 

defined as the order of the largest cyclic subgroup of the multiplicative group (Z/raZ) x . 
It may be computed via the identity A(lcm{a, b}) = lcm{ A(a),A(b)} and its values at 
prime powers: with tp being Euler's function, A(p a ) = (p(p a ) = (p — l)p fl_1 for every odd 
prime power p a and for 2 and 4, and A{2 a ) = ($){2 a )/2 = 2 a ~ 2 for a > 3. 

Statistical properties of A(n) were studied by Erdos, Schmutz, and the second author in 
[7\, and in particular, they showed that A(n) = nj exp((l + o(l)) log log n log log log n) 
as n — > oo through a certain set of integers of asymptotic density 1. This does not quite 
pinpoint the normal order of A{n) (even the sharper version of this theorem from [7\ falls 
short in this regard), but it is certainly a step in this direction, and does give the normal 
order of the function log(n//\(n)). 

In this paper we prove a result of similar quality for the function A(A(n)), which we 
have seen arises in connection with the period of the power generator. We obtain the 
same expression as with A{n), except that the log log n is squared. That is, A(A(n)) = 
n/exp((l + o(l)) (log log n) 2 log log log n) almost always. 

We are able to use this result to say something nontrivial about the number of cycles for 
the power generator. This problem has been considered in several papers, including |3|, 
[4J, and |15J. We show that for almost all integers n, the number of cycles for the £th power 
map modulo n is at least exp((l + o(l)) (log log n) 2 log log log n), and we conjecture that 
this lower bound is actually the truth. Under the assumption of the Generalized Riemann 
Hypothesis (GRH), and using a new result of Kurlberg and the second author [12J, we 
prove our conjecture. (By the GRH, we mean the Riemann Hypothesis for Kummerian 
fields as used by Hooley in his celebrated conditional proof of the Artin conjecture.) 

For an arithmetic function f(n) whose values are in the natural numbers, let j\{n) de- 
note the fcth iterate of / evaluated at n. One might ask about the normal behavior of A^{n) 
for k > 3. Here we make a conjecture for each fixed k. We also briefly consider the func- 
tion L(n) defined as the least k such that A^{n) = 1. A similar undertaking was made 
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by Erdos, Granville, Spiro, and the second author in @ for the function F(n) defined as 
the least k with 4>k( n ) = 1- Though A is very similar to (p, the behavior of L(n) and F(n) 
seem markedly different. We know that F(n) is always of order of magnitude log n, and 
it is shown in |5|, assuming the Elliott-Halberstam conjecture on the average distribution 
of primes in arithmetic progressions with large moduli, that in fact F(n) ~ alogn on a 
set of asymptotic density 1 for a particular positive constant a. We know far less about 
L(n), not even its typical order of magnitude. We raise the possibility that it is normally 
of order log log n and show that it is bounded by this order infinitely often. 
A more formal statement of our results follows. 

Theorem 1. The normal order of log (n/A(A(n))) is (log log n) 2 log log log n. That is, 
A(A(n)) = nexp ( — (1 + o(l))(loglogn) 2 logloglogn J 

as n — > oo through a set of integers of asymptotic density 1. 

We actually prove the slightly stronger result: given any function \p(n) going to infinity 
arbitrarily slowly, we have 

A(A(n)) = nexp(— (loglogn) 2 (logloglogn + 0(i/>(n)))) 

for almost all n. 

Given integers £,n > 2, let C(£,n) denote the number of cycles when iterating the 
modular power map x i— > x £ (mod n). 

Theorem 2. Given any fixed integer £ > 2, there is a set of integers of asymptotic density 1 such 
that as n — > oo through this set, 

C(£,n) > exp ((l + o(l)) (log log n) 2 log log log n). (1) 

Further, ife(n) tends to arbitrarily slowly, we have C(£, n) < n l l 2 ~ e ^ for almost all n. More- 
over, for a positive proportion of integers n we have C(£,n) < n A09 . Finally, if the Generalized 
Riemann Hypothesis (GRH) is true, we have equality in ((TJ on a set of integers n of asymptotic 
density 1. 

Conjecture 3. The normal order of log (n/A^(n)) is (l/(k — l)!)(loglogn) fc logloglogn. 
That is, for each fixed integer k > 1, 

A k (n) =nexp(- ( r k _ 1 y +°( 1 )j ( 1 og 1 og") fc (logloglogn) J 

for almost all n. 

Define L(n) to be the number of iterations of A required to take n to 1, that is, L(n) 
equals the smallest nonnegative integer k such that A^{n) = 1. 

Theorem 4. There are infinitely many integers n such that L(n) < (1/ log 2 + o(l)) log log n. 
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2. Notation, strategy, and preliminaries 

The proof of Theorem [TJ our principal result, proceeds by comparing the prime divi- 
sors of A(A(n)) with those of 4>(<p(n)). The primes dividing 4>(m) and A(m) are always 
the same. However, this is not always true for 4>(<p(m)) and A(A(m)). The prime 2 clearly 
causes problems; for example, we have 4>(<p(8)) = 2butA(/\(8)) = 1. However this prob- 
lem also arises from the interaction between different primes, for example, 4>(<p(9l)) = 24 
butA(A(91)) =2. 

We shall use the following notation throughout the paper. The letters p, q, r will always 
denote primes. Let Vq(n) denote the exponent on q in the prime factorization of n, so that 

„ = Y[q Vq{n) 

<? 

for every positive integer n. We \efP n = {p:p = l (mod n)}. We let x > e e be a real 
number and y = y(x) = log log x. By ip(x) we denote a function tending to infinity but 
more slowly than log log log x = logy. In Sections 2-5, the phrase "for almost all n" 
always means "for all but 0(x/ip(x)) integers n < x" . 
First we argue that the "large" prime divisors typically do not contribute significantly: 

Proposition 5. For almost all n < x, the prime divisors of <p{(p{n)) and A(A(n)) that exceed y 2 
are identical. 

Proposition 6. For almost all n < x, 



£ v q (4>(4>(n)))logq « y 2 iP(x). (2) 

q>y 2 
p,(0(«(n)))>2 

Next we argue that the contribution of "small" primes to A(A(n)) is typically small: 
Proposition 7. For almost all n < x,we have 

E MKKn)))logq < y 2 ip(x). 

Finally, we develop an understanding of the typical contribution of small primes to 
4>(4>(n)) by comparing it to the additive function h\n) defined by 

h (») =111 v q (r-l)logq. (3) 

p\n r\p—l q<y 2 

Proposition 8. For almost all n < x, 

£ v q (4>(4>(n)))logq = h(n)+0(y log y-tp(x)). 

q<y 2 

Proposition 9. For almost all n < x,zve have h(n) = y 2 log y + 0(y 2 ). 
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Proof of Theorem^ Let x be a sufficiently large real number. For any positive integer n < x 
we may write 

log n = log ^ + log ^ n) +log ^ (n)) 
i0g A(A(n)) i0g 0(n) + i0g 0(*(n)) + ° g A(A(n)) 

Recall that n/(p(n) <C log log n, and so the first two terms are both 0(log log log x). Thus, 
it suffices to show that 

log ^A^ = ( l °S^gx) 2 (logloglogx + 0^(x))) = y 2 log y + 0(y 2 ^(x)) (4) 

for almost all n < x. We write 

= £ v q (4>(4>(n)))\ogq- £ v q (A(A(n))) log q (5) 

<?<y 2 <?<y 2 

+ I (t7,(0(*(n)))-t7,(A(A(n))))log^. 

<?>y 2 

Since A(A(n)) always divides 4>(4>(n)), the coefficients of logq in this last sum are all 
nonnegative. On the other hand, Proposition |5]te lis us that for almost all n < x, whenever 
Vq((p(4>(n))) > we have v q (A(A(n))) > as well. Therefore the primes q for which 
Vq(tp(4>(n))) < 1 do not contribute to this last sum at all, that is, 



< £ (v q {<t>{<t>{n))) -v q {A{A{n))))logq 

(v q (cf>(<p(n))) -v q (A(A(n)))) log q 



q>y 2 



q>y 2 



v q {4>(<p(n)))>2 

< £ v q (4)(4){n))) log q < y 2 ip(x) 

q>y 2 
v q {4>{4>{n)))>2 

for almost all n < x by Propositions |5] and |6j Moreover, Proposition |7| tells us that the 
second sum on the right-hand side of equation © is 0(y 2 ip(x)) for almost all n < x. 
Therefore equation ||5jl becomes 

for almost all n < x. By Proposition |SJ the sum on the right-hand side can be replaced by 
h(n) for almost all n < x, the error 0(y log y • ip(x)) in that proposition being absorbed 
into the existing error 0{y 2 \l>{x)). Finally, Proposition tells us that h(n) = y 2 \ogy + 
0(y 2 ) for almost all n < x. We conclude that equation (|4|) is satisfied for almost all n < x, 
which establishes the theorem. □ 
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Given integers a and n, recall that n(t; n, a) denotes the number of primes up to t that 
are congruent to a (mod n). The Brun-Titchmarsh inequality (see 11101 Theorem 3.7]) states 
that 

7t(t;n,a) < -rr-v-, — rrr^ ( 6 ) 

for all t > n. We use repeatedly a weak form of this inequality, valid for all t > e e , 

I i « W, (7) 

f<t V <t>(n) 

peV„ 

which follows from the estimate l|6jl with a = 1 by partial summation. When n/(p(n) is 
bounded, this estimate simplifies to 

£ I « ^i^s!. (8) 

For example, we shall employ this last estimate when n is a prime or a prime power and 
when n is the product of two primes or prime powers; in these cases we have n/<p(n) < 3. 
We also quote the fact (see Norton |13| or the paper fl4fl of the second author) that 

y 1 loglogt /logHX 

p ^p *(n) +U U(n)^ W 

This readily implies that 

y _J_ loglogt /lognx 

J^p-1 <J>(n) +U U(n)^ UUJ 

as well, since (noting that the smallest possible term in the sum is p = n + 1) the difference 
equals 

y 1 < y - — 

We occasionally use the Chebyshev upper bound 

£logp < £A(«) « z, (11) 

p<z n<z 

where A(n) is the von Mangoldt function, as well as the weaker versions 

£^«logz, £^«log 2 z (12) 

p<z r p<z r 

and the tail estimates 

— V- < -> E -9 < -i — > ( 13 ) 
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each of which can be derived from the estimate (JTTJl by partial summation. We shall also 
need at one point a weak form of the asymptotic formula of Mertens, 

£^p = logz + 0(l). (14) 

p<z r 

For any polynomial P(x), we also note the series estimate 

^ m a 

uniformly for m > 2, valid since the series E^°=o P( a ) za converges uniformly for \z\ < \. 
The estimates 

E m <<p i, z m « P i, (15) 

aen m m aeN m z 

m">z 

valid uniformly for any integer m > 2, follow easily by factoring out the first denominator 
occurring in each sum. 

3. Large primes dividing 4>(4>(n)) and A(A(n)) 

Proof of Proposition^ If ^ is any prime, then q divides (p((p(n)) if and only if at least one 
of the following criteria holds: 

• q 3 \ n, 

• there exists p G V 2 with p | n, 



• there exists p £ Vq with p 2 | n, 



'/- 

• there exist r G Vq and p eP r with p | n, 

• ^ 2 J n and there exists p £ Vq with p | n, 



• there exist distinct p\, p?_ G 7^ with pip2 | ft. 

In the first four of these six cases, it is easily checked that q \ A(A(n)) as well. (This is not 
quite true for q = 2, but in this proof we shall only consider primes q > y 2 .) Therefore we 
can estimate the number of integers n < x for which q divides <p((p(n)) but not A(A(n)) 
as follows: 

E i<IE'+I I E i< Ei+ E E -^. 

"<* peV q n<x pieV<,p 2 eV q n<x v eV q 1 V p r <EV q p 2 eV q r 1 ^ 2 

q\<p(<p(n)) q 2 p\n p + p pip 2 \n 

q\A{Kn)) 

Using three applications of the Brun-Titchmarsh inequality ©, we conclude that for any 
odd prime q, 

y 1 «^ + 2^«V 
„s* 1 3 1 1 1 2 

q\4>((p(n)) 
q\KMn)) 



Consequently, by the tail estimate dl3t and the condition ip(x) = o(log y), 



1 xy 2 x x 



I I l«^L-2« -Tf£^2 < T^T « 



tfA(A(n)) 

Therefore for almost all n < x, every prime q > y 2 dividing 4>(tp(n)) also divides A(A(n)), 
as asserted. D 

Lemma 10. Given a real number x > 3 and a -prime q > y 2 , define Sq = Sq(x) to be the set of 
all integers n < xfor which at least one of the following criteria holds: 

• q 2 \ n, 

• there exists p G V q i with p \ n, 

• there exist r G V q i and p £ V r with p \ n, 

• there exist distinct r\,ri,ro, G Vq and p G V nr2 r 3 with p \ n, 

• there exist distinct rxJij^r^ G Vq, p\ G T-V^/ fln <^ P2 G 7^4 zw'tfi P1P2 | «• 
Then #ze cardinality of Sq is 0(xy 2 /q 2 ). 

Proof. The number of integers up to x for which any particular one of the five criteria 
holds is easily shown to be 0(xy 2 /q 2 ). For the sake of conciseness, we show the details of 
this calculation only for the last criterion, which is the most complicated. The number of 
integers n up to x for which there exist distinct r\, rj_, r^, r± G Vq, p\ G V Tl r 2 , an d pi G Vr 3 r A 
with p\pi\n is at most 

E E E 1 < E E j4- 

P2eVr 3 r 4 PmW PieVr 3ri 

Using six applications of the Brun-Titchmarsh estimate ©, we have 

y y x <*- y x y ^ x v < x v 

P2e^ 3 r 4 

the last inequality being valid due to the hypothesis q > y 2 . □ 

Proof of Proposition® Define S = S(x) to be the union of Sq over all primes q > y 2 , where 
Sq is defined as in the statement of Lemma [Toj Using #A to denote the cardinality of a set 
A, Lemma llOl implies that 

#S< V #Sq « V ^ « , X f 7 « T^T 

by the tail estimate (O and the condition ip(x) = o(logy). Therefore to prove that the 
estimate © holds for almost all integers n < x, it suffices to prove that it holds for almost 
all integers n < x that are not in the set S. This in turn is implied by the upper bound 

£ £ Vq{<p{<p{n)))\o%q < xy 2 , (16) 

n < x q>y 2 

n $ S v q (4>(4>(n)))>2 



which we proceed now to establish. 

Fix a prime q > y 2 and an integer a > 2 for the moment. In general, there are many 
ways in which q a could divide 4>(<p(n)), depending on the power to which q divides n 
itself, the power to which q divides numbers of the form p — 1 with p \ n, and so forth. 
However, for integers n ^ S, most of these various possibilities are ruled out by one of 
the five criteria defining the sets Sq. In fact, for n ^ S, there are only two ways for q a to 
divide 4>(<p(n)): 

• there are distinct r\,...,r a C Vq and distinct p\ G V ri , ■■■,pa £ Vy a with p\...p a \n, 

• there are distinct r\,...,r a C Vq, distinct p\ G V n , ..., p a -i £ ^ , r fl _ 2 ' an d V ^ 
^r fl _ar„Withpi...p fl |n. 

(We refer to the former case as the "supersquarefree" case.) 

Still considering q and a fixed, the number of integers n up to x satisfying each of these 
two conditions is at most 

i-i 7^ 2-i L — L* 7[ L ~77 77~ 

n,...,r a eV q "• Vl eV n n<x n,...,r a eV q u ' pieV n Pi-~P« 

and 



Fflfc' ra Pa^l r, 



i < y y 

- Li (n-l\\ Li 



r 1 ,..7T6^ 2! («-2)! n % x n % ri ^ eVq {a-2)\ ^ pi... Pa -2P 

Pa - 2 eK- 2 Pl - Pa - 2Pln P^eK- 2 

respectively, the factors l/a\ and 1/2! (a — 2)! coming from the various possible permu- 
tations of the primes r;. Letting c > 1 be the constant implied in the Brun-Titchmarsh 
inequality © as applied to moduli n that are divisible by at most two distinct primes, we 
see that 

y I y x < y 1 x ^ a < x ( c y) 2a 

VaeVr a 



y 1 y x < y 1 xjcyy- 1 < x{cy) la - 1 

..keV q ( a - 2 V- v h x Pi-Pa-2P ~ n/ ± eVq (a-2)l r x ...r a -(a-2)lf 



and 

y 1 y X ^ ^ 1 xjcyy- 1 ^ X{cy) la - 1 

n,...,r a eV^ a ~ 2 y- Pl eV ri "' 

Pa-2ePr a _ 2 

Therefore the number of integers n < x such that n ^ S and q a \ <p(4>(n)) is 

x(cy) 2a x{cy) 2a ~ x c 2a xy* 

~ a\q a + (a-2)\q a {a-2)\q 2 ' K ' 

where we have used the assumption q > y 2 . 
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We now establish the estimate dl6b . Note that 

£ £ v q (4>(<f>(n)))lo S q < 2£ £ M<K<p(«))) - 1) logg 

"<* g>y 2 n<x q>y 2 

n $ s v q (4>(4>(n)))>2 n £ S P,(0(0(n)))>2 



<7>y 2 fl>2 «<x 



<? fl lW(«)) 



Therefore, using the bound (fT7|) for each pair ^ and a, 



2a 4 

£ £ *,(*(*(»))) log* < 2 £ log* £ J** 

«<* ^y 2 , >y 2 fl >2 V" Z M 

n * S iV(*(*(n)))>2 

= 2cW £ ^|i « ^ = xy 2 

<?>y 2 ^ ^ 

by the tail estimate (Tl3b . This establishes the estimate dl6b and hence the proposition. □ 

4. Small primes and the reduction to h(n) 

Lemma 11. For any prime power q a , the number of positive integers n < xfor which q a divides 
A(A(n)) isO(xy 2 /q a ). 

Proof. The prime power q a divides A(A(n)) only if at least one of the following criteria 
holds: 

• q a+1 | n, 

• there exists p G Vq* with p 2 \ n, 

• there exists p G V qa +\ with p \ n, 

• there exist r G Vq« and p G V r with p \ n. 

Thus 

E i< E i+ E Ei+ E Ei+ E E Ei 

n<x n<x peV q a n<x p6P fl+ i«<i reP q a peV r n<x 

q"\A(A(n)) q a+2 \n ' p 2 \n q p\n ' p\n 

<^+ E i+ E x z+ E Ef- (is) 

p<^i p<x P— x 

In the first of these three sums, it is sufficient to notice that any p G Vq» must exceed q a , 
which leads to the estimate 

EX -r — i X X 

-9 < ) —9 < -7- 

r,2 -^ m 2 ^fl 

peP 9 « P «><?« m 9 
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To bound the second and third sums, we invoke the Brun-Titchmarsh estimate © a total 
of three times: 



x xy 



E - « 

p<x 



fl+1 



,2 



reP q a P eP r P reV f r <? 

V<x r <x 

Using these three estimates, dl8t gives 

2 2 

Ex x x 1/ x 1/ x iy 

1 <gr 1 1 — - — i — — <c — — 
;i<x q a + 2 f q a+l f q a ' 

q"\l{X{n)) 

which establishes the lemma. □ 

Proof of Proposition^ We have 
£ v q (A(A(n)))\ogc, = £ log^ £ 1 < £ log^ £ 1+ £ log^ £ 1. 

o<V 2 £?<V 2 fl GN o<v 2 «€N o<i/ 2 fl€N 

*~' <?"|A(A(n)) *~* cf<f '-* q a >y 2 

q*\A(A(n)) 

Since the first sum is simply 



£ log;? £ 1 = £ A(m) « y 2 

q<y 2 fl€N m<y 2 

<f<y 2 



(19) 



by the Chebyshev estimate dTTb , we have uniformly for n < x, 

£ v q (A(A{n)))logq « y 2 + £ log^ £ 1. 

q<y 2 q<y 2 aeN 

q a >y 2 

q"\A(A(n)) 

To show that this quantity is usually small, we sum this last double sum over n and apply 
Lemma ITT1 yielding 

E E i°g<? E i = E lo g<? E E i « E i°g<? E ^- 

"<*(7<V 2 «€N o<v 2 fl€N n<x q <u2 aeN " 

q a >y 2 q a >y 2 q a \MA(n)) ,_5 (? «>y2 

<f|A(A(«)) 

Using the geometric series sum dl5b and the Chebyshev estimate (HTJ, this becomes 

2 

E £ l°g<? £ 1 « £ log^-^r « xy 2 . 



M^X^y 2 fl6N ^y 2 T 

q a >y 2 
q"\A(A(n)) 
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Therefore if we sum both sides of dl9t over n, we obtain 

£ £ ^W( M ))) lo g<7 < *y 2 - 

«<*<7<1/ 2 

This implies that for almost all n < x, we have 

£ v q (A(A{n))) log q < y 2 ^(x), 

as desired. □ 

Proof of Proposition® Fix a prime q for the moment. For any positive integer m, the usual 
formula for 4>(m) readily implies 

v q (4)(m)) = max{0,v q (m)-l} + Y,v q (p-l), 

p\m 

which we use in the form 

E^(p-l) < v q {<j){m)) < £^(p-l)+^( m )- 

p\m p\m 

Using these inequalities twice, first with m = 4>(n) and then with m = n, we see that 
£ v q (p-l) < v q (4>(<p(n))) < £ v q {p-l)+v q {c$>{n)) 

p\4>{n) P\(p(n) 

< £ v q (p-l) + £v q (p-l)+v q (n). (20) 

p\cp(n) p\n 

Now a prime r divides <p(n) if and only if either r 2 | n or there exists a prime p | n such 
that r \ p — 1. Therefore 

I I ^(r-1) < £ v q (r-l) < £ £ r ¥ (r-l)+ £ v q (r-l), 

p\nr\p—l r\4>( n ) p\nr\p—\ r:r 2 \n 

the latter inequality accounting for the possibility that both criteria hold for some prime r. 
When we combine these inequalities with those in equation d20l and subtract the double 
sum over p and r throughout, we obtain 

< r ? (0(4>(n)))-£ £ v q (r-l) < £ *,(r-l) + J>,(p-l)+i>,(n) 

p\n r\p—\ r: r 2 \n p\n 

< 2£^(p-l)+^(n). 

p\n 

Now we multiply through by log q and sum over all primes q < y 2 to conclude that for 
any positive integer n, 

< £ v q (<p((p(n))) log q-h(n) < 2 £ Y t v q (p-l)logq+ £ ^(n)log^. 

<?<y 2 q<y 2 v\ n q<y 2 

It remains to show that the right-hand side of this last inequality is 0(y log y ■ ip(x)) for 
almost all n < x, which we accomplish by establishing the estimate 

£ £ £^(p-l)l°g<7 + £ £ v q (n)logq < xylogy. (21) 

n<x q <y2p\ n n<x q <y2 
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We may rewrite the first term on the left-hand side as 

n<xq<y 2 p\n n<x q<y2 p\ n aeN 

q a \p—l 

= I log? I E £i< Eiogol I *-■ 

q<y2 aeN peVqa n<x q < y 2 aeN peP q a r 

p\n 

Using the Brun-Titchmarsh inequality © and the geometric series estimate dl5b , we ob- 
tain 

£ £ EVp- 1 ) 1 ^ < * £ log<?£-^ < *y £ -^ < ^ylogy 2 . 

n<xq<y2p\ n cj<y2 aeN" q<y2 " 

The second term on the left-hand side of (|2l|) is even simpler: we have 

£ £ V n ) lo g<? = £ lo g<? £ £ i < £ lo g<? £ 4. 

n<xq<y2 q<y2 aeN n<x q<y2 aeN " 

and using the geometric series bound ((l5)l and the weak Chebyshev estimate (fl2)l yields 

£ £ ^(n)logg < x £ -^ < ^logy 2 . 

"<*<7<i/ 2 q<y 2 " 

The last two estimates therefore establish (|2l|) and hence the proposition. D 

5. The normal order of h(n) 

Recall the definition ©: h(n) = Y.p\ n s Lr\p-\Lq<y2V q{r — 1) log^. We now calculate 
the normal order of the additive function h(n) via the Turan-Kubilius inequality (see 
llffl . Lemma 3.1). If we define 

M lW = £ k M, M2 (x) = £ *&£, 

p<x r p<x r 

then the Turan-Kubilius inequality asserts that 

£(^(n)-M 1 (x)) 2 < xM 2 (x). (22) 

Proposition 12. We have Mi (x) = y 2 log y + 0(y 2 ) for all x > e e< . 
Proposition 13. We have M.2(x) <C y 3 log y/or aZZ x > e e ' ' . 

Proof of Proposition^ Let N denote the number of n < x for which \h(n) — Mi(x)\ > y 2 . 
The contribution of such n to the sum in (J22)l is at least y^N. Thus, Proposition Il3limplies 
that N <C x(log y) 2 /y- Hence, Proposition lL2l implies that h(n) = y 2 log y + 0(y 2 ) for all 
n < x but for a set of size 0(x(log y) 2 )/y)- This proves Proposition |9j D 

To calculate M\{x) and M2(x) we shall first calculate L»<^(p) an d L»<t^(p) 2 an d 
then account for the weights 1/p using partial summation. We begin the evaluation of 
E»<( h(p) with a lemma. 
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Lemma 14. Let bbe a positive integer and t > e e a real number. 

(a) Ifb>t 1 /*then 

£ n{t;r,l) « — f-. 
reV b ° 

(b) ljb<t x l^then 

En -,\ bt 
7c(t;r,l) < TTTvn :• 
,en ^(fc) 2 logf 

r>*V3 

and 

7r(f;r,l) < ,.°. 6 x 
ren <K&)log* 

Remark. The exponents | and | are rather arbitrary and chosen only for simplicity; any 

-i 

two exponents < a < ft < j would do equally well. 

Proof. Notice that in all three sums, the only contributing terms are those with r > b and 
r < t. If b > t 1 / 4 , then the trivial bound n{t; r, 1) < t/r gives 

v /. „\ - v t ^ v-i t tlogt 

£ n{t;r,\) < £ - < £ - « -J*-, 

reP b reV b ' m=\ (mod b) m V 

t l / A <r<t t 1 / 4 <m<t 

proving part (a) of the lemma. 

We now assume b < t 1 / 4 . We have 

£ n(t; r, 1) = #{(m, r) : r = 1 (mod fc), r > r' 3 , mr + 1 < t, mr + 1 and r both prime} 

reV b 
r>*V3 

< £ #{r < ^ : r = 1 (mod fc), mr + 1 and r both prime} 

m<f 2 / 3 

< £ -^(mfc)0(fc)log 2 — 

m<t 2 ' 3 

by Brun's sieve method (see [10. Corollary 2.4.1]). We have -^ > f 1 / 12 and so log -^ » 
log £. We also have 4>(mb) > 4>(m)<p(b) and the standard estimate 

Therefore 

^ . . ^ &i btlogt 2 / 3 ^ bt 

in m ^4>{m)4>{bY\o Z 2 t <p(b) 2 Wt ~ ^r\o S t 

r>t V3 

13 



establishing the first estimate in part (b). Finally, by the Brun-Titchmarsh inequalities ® 
and ©, 

r <fL/3 r<f : /3 r<i : / 3 

Combining this estimate with the first half of part (b) and the standard estimate b/<p(b) <C 
log log b establishes the second half. □ 

Lemma 15. For all real numbers x > e e and t > e e , we have 

E, , s It log log Hog y ^/ Hog log £ Hoe y *i±. ?\ 

hip) = ^—2 g-^ + Of , 6 6 + - 6 u + f 3 / 4 logf -y 2 ). 
„^ r logf V loef log* & v ) 



p<t 



Remark. In particular, we have E p<x h(p) «i log log x log yj log x = xy log y/ log x. 
Proof. We may rewrite 

1%) = 111 **('-!) log* =1111 log* 

P<* P<f r|p-l(j<y 2 P<t r\p-\ q<y 2 aeN 

q a \r—l 

= £ log*L £ £1=1 log* £ £ 7r(*;r,l). (24) 

<7<i/ 2 aeNr:q a \r-l p<t q<yi aeNreV^a 

r\p—l 

The main contribution to this triple sum comes from the terms with q a < t 1 ' 4 and r < f 1 ' 3 . 
In fact, using Lemma IT4T a) we can bound the contribution from the terms with q a large 

by 

£ log* £ £ 7r(*;r,l) < £ log* £ £ 7r(*;r,l) « £ log* £ ^ 

o<y 2 aeN re7> q<if aeN reV q a q<y 2 aeN " 

' ' q a >t x l^ ' ' ' q a >t l l 4 ' ' ' q a >t l l* 

«^t^ l -^«t^lo Z t.y\ 

q<y2 l 

where the last two estimates are due to the geometric series bound (|T5j and the Chebyshev 
bound (TPT1) . Similarly, using the first half of Lemma IT4T b) we can bound the contribution 
from the terms with *" small and r large by 

£ log* £ £ n(t;r,l) « £ log* £ ' « * £ ^ « ^1, 

"? fl < il/4 r>fl/3 <?"< fl/4 

where again the last two estimates are due to the geometric series bound (IT5t and the 
weak Chebyshev bound (IL?! . In light of these two estimates, equation d24b becomes 

£%)= £log* £ £ n{t; r, 1) + O (^ log f.y 2 + ^). (25) 

p<f <7<y2 fl6 N r€Pq« i0 g r 



<f<&* T<t fl3 
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Define E(t;r,l) = 7t(t;r,l) -U(t)/(r-l). We have 
I>g<? £ £ n(t;r,l) = £ log^ £ £ (M- + E(* ;r ,i; 

q<y 2 aeN reV q a q<y 2 «6N reV q a 

= Eiog<? I I 7rr + °f E lo s<7 I I |E(*;^i)l). (26) 

q<y 2 fl6N re7> r i V q<y 2 aeN reV a a / 

Let fl(ra) denote the number of divisors of m that are primes or prime powers. Using the 
estimate Cl(m) <C logra, we quickly dispose of 

E log*? E I |E(*;r,l)| = logy £ |E(*;r,l)| £ £ 1 

0<y2 fl€ N re7> Kfl/3 a<y 2 fl€ N 

9"<* 1/4 r < t i/3 £ ? fl l r - 1 

< logy £ |E(f;r,l)|a(r-l) 

r<fV3 

« logylogf £ |E(*;r,l)| « ^ff 

r <fl/3 & 

by the Bombieri- Vinogradov theorem (we could equally well put any power of log t in the 
denominator of the final expression if we needed). Inserting this estimate into equation 
d26l l, we see that equation d25t becomes 

£ %) = li(0 £ log , £ £ -1- + O (f 3 / 4 log t • y 2 + !£m . (27) 

p<* q<y 2 aeNreV q a r i io g r 7 

r<tV3 



We have by equation (|T0 



v-> i v-< v-> 1 v-< i v-< /log log f 1 / 3 /logo" 

E^E E —[ = Ei»g?E( ^ ( ; fl) + °HF 



r<*V3 



Mio g io gt+ o, 1 ))i,o g ,E(I + o( ? L)) + o(E 1 o g ^E±) 

Mlo g lo gf+ O (1 ))l(^ + 0(^)) + 0( E ^), 

using the geometric series estimate <TT5b . Using the Mertens formula (JHJ) to evaluate the 
main term and the weak Chebyshev estimates (JT2J to bound the error terms, we see that 

i 
E lo g<?E E 73T = log log flog y 2 + 0(logy + log log f + log 2 y). 

<7<y2 fl6 N reV q a Y 

rKt 1 / 3 
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We conclude from equation G3 and the fact that li(t) = f/ log t + 0{t/ log 2 t) that 
£%) = li(t)(loglogtlogy 2 + 0(logy + loglogf + log 2 y)) 



+ 0^ 3 / 4 lo^-v 2 + ^ 

>g 



+ o(f 3 / 4 logf.y 2 + ^ 
V ° J logt 

2tlog log t logy / Hog log t Hog 2 y 3/4 2 

logi V logt logi 8 y 



>g' 

as asserted. □ 



Proof of PropositionuJ] In an explicit example of the technique of partial summation, we 
write 



p<* V v < e e V e e <p<x \ x J V l / 

1 /•* dt 

= o(i) + - £ %)+/ % £ h(p). 



:p<x Je * e e <p<t 

The quantity Y,p<th(p) has been evaluated asymptotically in Lemma Il5l and the quan- 
tity Le e <o<f h(p) differs by only 0(1). Therefore we may use Lemma Il5l and the remark 
following its statement to write 

W x 2/ lo gr 



Mi(x) = 0(l) + -0( 



logx 

/ " IM / ^1 11.1", 1UU / JUVi 1/ _ / I iOH. HIH. I I l(.)Vd~ 



r'ft/2tloglogtlogy /tloglogt tlog y_ m ; 

A* f 2 V log £ V log i log i & y 

oz-y^gy^! r 21oglog t 

V logX ) h)j Je? tlogt 

/ r* log log f , , 2 f* dt j f x dt \ 

+ 01 f 6 dt + log 2 y / + y 2 / -^ ) . 

V7^ tiogt 6 y y^ tiogt y A* (5/4 y 



Each of these integrals can be explicitly evaluated, resulting in the asymptotic formula 

Mi(x) = logy((loglogx) 2 -l) + o(^^ + (loglogx) 2 + log 2 yloglogx + y 2 ) 

= y 2 log y + 0(y 2 ), 
as claimed. D 

Now we turn our attention to M.2(x), beginning with some preliminary lemmas. 
Lemma 16. For all real numbers x > e e ' and t > e e , we have 

£ logftlog,, E E E KO^logt/logy+ '^'^.'^y . 

q,,q 2 <y 2 ai,a 2 €NreP a,r\V a 2 V< 1 ° 

' <\ '2 p=l(mDdr) 
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Proof. Since the exact form of V «i fl V « 2 depends on whether or not c\\ = qi, we split the 
expression in question into two separate sums: 

£ log qi log q 2 £ £ £ 1 (28) 

U-\,U2<V 2 ai,a 2 eNreV o,nP a 7 v<t 

' »l '2 p=l(modr) 

= £ lo g 2 <? £ £ 7t(t;r,l)+ £ log ^i log ^ 2 £ £ 7c(t;r,l] 

q< y 2 a lr a 2 eNreV meL x{ ai ,a 2 } qi,qi<y 2 a lf a 2 eNreV ai « 2 

Noting that there are exactly 2a — 1 ordered pairs («i, 02) f° r which max{fli, ^2} = a , we 
have 

£l°g 2 <? £ £ TrfrU) = £log 2 ^£(2a-l) £ n(t;r,l) 

q<yl fll,fl 2 eNr6^ max{fll , fl2} g<y2 fl€ N reV q a 

v-. , 2 v-i fltlogf v-> t 2 v-i fl* log log* 

« £ iog 2 ^ £ — ^+ £ iog 2 ^ £ f f 



by Lemma 1141 Since 

E log 2 ? E ^ « "ogHogy 2 E ^f « ' 3/4 log (-/logy 

o<y 2 aeN */ a<y2 l ' 

by the Chebyshev bound (|TTJ), and 

9 9 

E, 2 v fl* log log* flog log * v-, log q * log log *• log 1/ 

,< v2 *eN ^ lo g f lo § f ,< y2 9 log* 

q a <t x l 4 

by ((llj) and its weaker version ((12)) , the first term on the right-hand side of equation 
is bounded by the estimate asserted in the statement of the lemma. 

It remains to satisfactorily bound the second term on the right-hand side of equation 
d2*8l . Again dividing the sum so that Lemma ITU can be applied, we have 

£ log ^1 log ^ 2 £ £ n(t;r,l) < £ log q\ log q 2 £ -^4j 

q,,q 2 <y 2 a^^eNreV ai « 2 £?1,£?2<V 2 fli,fl 2 eN ^1^2 

/ fl (l dl «?^ ,1 /4 

v-, x-, flog log* 

+ E i°g«iog, 2 e ^rfi^- 



<7i,<?2<y 2 fli,fl 2 eN 
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%?<tV4 



In the first of these two terms, at least one of the eft must exceed t 1 ' 8 , and so using the 
estimates dl5t , l(TT)l , and ((12)) we see that 

£ logqilog^ £ -«rll ^ 2£log£ £ log^i £ — £ log ^2 £ -^ 

<ji,<72<2/ 2 fli,fl 2 eN ll ^2 qi < y 2 fljeN 9l q 2 < y 2 a 2 eN ^2 



« f 7 / 8 log t-y 1 logy. 



In the second, we simply ignore the restriction cj^cj^ < f 1 ' 4 and use the estimates (fT5l > 
and ((12)) , obtaining 

V- i i V- ^ lo g lo g^ * log log*/ V- i \- 1 

^ Hog log* / y. log^ N 2 



lo g* V,ga 9 

Hog log f- log 2 y 

log t 

This concludes the proof of the lemma. □ 

The following lemma is similar in spirit to Lemma 1141 but is a bit more complicated to 
state and prove. 

Lemma 17. Let b\ and b 2 be positive integers and t > e e a real number. 
(a) ljb x > t 1 / 8 or b 2 > t 1 / 8 then 

2 



V V /, 1\ ^°g t 

£ £ n{t-nr 2 ,l) « — \- 



ne^i neVb. 



2 



(b) If neither b\ nor b 2 exceeds t 1 / 8 then 

^ log log £ 



r 1 r 2 >< 1/ ' 3 

and 
Remark. Again, the values 1/8 and 1/3 for the exponents are rather arbitrary. 
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Proof. The bound in part (a) follows from the trivial estimate n(t; y\y 2 , 1) <S t/Y\Y 2 , just as 
in the proof of Lemma ITiT a). For the first estimate in part (b), we my assume that Y\ < y 2 
by symmetry We use Brun's method again: 



£ £ 7r(f;rir 2/ l) 

neV h r 2 eV b2 
n<n 

r^r 2 >t 1 / 3 

= #{(m,Yi,Y 2 ) '• Y\ = 1 (mod &i), r 2 = 1 (mod b 2 ), r i < r 2/ ^1^2 > * / 
rar^ + 1 < £ , and Y\, Yi, and rar 1/2 + 1 are all prime} 

< I I Ei 

m<t 2 / 3 n<JTJm r 2 <t/mri 

neV r 2 eV h 

mr\r 2 +l prime 

mY\b2 t/mr\ 



neV h 



Notice that tjmr x b 2 > (y/i/m)/b 2 > t l l 6 /t l l s = t 1 / 24 , and so 



£ £ 7r(f;r 1 r 2/ l)< — 2- £ £ .,,.-, 



ne^ ^ ' "' ~ ^g 2 t m ^ 2/3n< ^cp(b 2 n(m) ( p(Y 1 ) 

nev h 



n<r 2 

r 1 r 2 >t 1 / 3 



b 2 t loglogt y 

JL/t \J./L \9 1 2. i—l 



<C 



<P{b x )4>{b 2 y log 1 1 m ^ 2/i 4>(m) 

&2^1oglog^ 
*(&l)(^(&2) 2 logf 



by the estimates and d23b as desired. The second estimate of part (b) is a consequence 
of the first estimate and 

V V nr(t- VV 1\ ^ ^(loglogQ 2 

> > 7t[t)Y\Y 2 ,\) <C -, 

^ r2 t^, 2 <K&iM& 2 )logf 

which follows from the Brun-Titchmarsh inequality just as in the proof of Lemma iMll j). 

□ 
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Proof of Proposition 1731 We may rewrite 



E%r = E E I I iog<? 

cf\r—\ 



2 



E log^iog^ E E E * 

<7iW2<y 2 ai,a 2 eHrieV a x p<t 

q i p=l(modri) 
r 2 €P« 2p = 1(modr2) 

E logfiiog^ E E E * 

'?l/'?2<y 2 fli / fl 2 eNr 1 e'P a a p<f 

q i psl^odrj) 
r 2 eP« 2p = 1(modr2) 

+ o( t '/'lo gt ./lo gy+ »^), 



the last step due to Lemma [T6| Since Y\ and r2 are distinct primes, the innermost sum is 
simply n (t; r\Ti, 1), and thus 

E^(P) 2 ^ E log ^i log ^ 2 E E nfrWiA) 

p<t qi,qi<y 2 ai,a 2 enrxeP a t 

r 2 eV a 2 

1 2 

+ o(^io gt ./io gy+ "°g";^ lo g 2 y ). (29) 

The contribution to the sum on the right-hand side of equation d29t from those terms 
for which cff > t 1 / 8 is 

E log <7i log Qi E E nfrri^A) 

qi,q2<y 2 a lr a 2 eN rieV a^ 

ft >f r 2 eP « 2 

«2 



v-> i i v ^l°g * 

« E iog?iiog? 2 E -«r 

fli,fl 2 <y 2 fli,a 2 6N 'fi Hi 

« flog 2 ! I L^II^ 

« «-* E^ E,** 

« i 7 / 8 log 2 f-y 2 logy 
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by Lemma HZta) and the estimates dl5t . (JTTJ), and ((l2)l : the contribution from the terms for 
which qf 1 > t 1 ' 8 is bounded likewise. The remaining contribution is 

£ log q\ log q 2 £ £ n{t- l r l r 2l l) 

qi,q2<V 2 ai,a 2 €N rxeP uj 

«1 "2 <f l/8 ?1 

<7l ^2 ^ ' r 2 €P fl2 
«2 



< £ log ft log <? 2 £ 



f(loglogQ : 

1„«2 



qi,q 2 <y 2 a l ,a 1 en 1l ll 1°S f 



tf*?^ 1 ' 



^ t(l0gl0gQ 2 / y y logq 



<C 



lo g f V^^N I" 

f(loglog0 2 log 2 y 



log* 

by Lemma HZtb) and the estimates (fl5)l and ((12)| . Using both these bounds in equation 
p9t , we conclude that 

EMrt 2 «^io g >- y 2 io 8y+ t(i ° 8io . g ' ) ; iog2y . 

We now evaluate M2(x) using partial summation. We have 

p<x P p<e p P x e e <p<x Je l e e <p<t 

1 x(log log x) 2 logy 



< 1 + 



x logx 

2i_2 



f x dt/ 7/ a, ?, ffloglogO log y^ 

+ /, ? (» 7 ' 8 log f ■ y 2 log y + JLL y 

« 4JgSV ■ny'iogy f^ + Wy f {1 °f}° s ! )2 it. 

log X * ° * 7^ f9/8 ° * y e , f log £ 



>g 

Evaluating these two integrals explicitly, we obtain 

M2 W < ^^ +y 2 logy + log 2 y • (log log x) 3 < y 3 log 2 y 

lOg X 

as claimed. D 

6. Normal number of cycles for the power generator 

If (u, n) = 1, then the sequence u 1 (mod n) for i = 1,2,... is purely periodic. We 
denote the length of the period by ord(u,n), which of course is the multiplicative order 
of u in (Z/nZ) x . Even when (u, n) > 1, the sequence u 1 (mod n) is eventually periodic, 
and we denote the length of the eventual cycle by ord* (u,n). So, letting nt u \ denote the 
largest divisor of n coprime to u, we have ord (u,n) = ord(u,nr u \). For example, let 
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u = 2, n = 24. The sequence u l (mod n) is 2, 4, 8, 16, 8, 16, . . . with cycle length 2, and so 
ord* (2, 24) = ord(2,3) =2. 

When iterating the £x\\ power map modulo n, the length of the eventual cycle starting 
with x = u is given by ord (£, ord (u,n)). We would like to have a criterion for when a 
residue is part of some cycle, that is, for when a residue is eventually sent back to itself 
when iterating xhx' (mod n). 

Lemma 18. A residue u is part of some cycle under iteration of the map x i— > x e (mod n) if and 
only if {£, ord* (u,n)) = land, with d= (u,n),wehave(d,n/d) = 1. 

Proof If (u, n) = d, then high powers of u will be = (mod n/ni d y). Thus, for u to be in 
a cycle it is necessary that n/ni d \ = d, that is, (d,n/d) = 1. Further, it is necessary that 

{£, ord*(w,n)) = 1. Indeed, if a = ord* (u,n), we would need £ l (mod a) to be purely 
periodic, which is equivalent to (£, <x) = 1. This proves the necessity of the condition. 
For the sufficiency, we have just noted that (£ , a) = 1 implies that £ l (mod a) is purely 
periodic. This implies in turn that the sequence u e ' (mod ni u \) is purely periodic. But the 
condition (d, n/d) = 1 implies that ni u \ = n/d, and as each u = (mod d), we have that 
u (mod n) is purely periodic. □ 

For d\n with (d,n/d) = 1, let C d (£,n) denote the number of cycles in the £th power 
map mod n that involve residues u with (u, n) = d. For the lower bound in Theorem |3 
we shall deal only with C\{£, n), that is, cycles involving numbers coprime to n. 

Lemma 19. We have C\(£,n) > (p(n)^/A(A(n)). 

Proof. It is easy to see that the subgroup of (Z/nZ) x of residues u with (£, ord(u, n)) = 
1 has size (p{n)^y (In fact, this is true for any finite abelian group G: the size of the 
subgroup of elements with order coprime to £ is \G\ rgy) As the length of any cycle in the 
£th power map is bounded above by A(A(n) ), the lemma follows immediately. □ 

To investigate the normal size of cp(n)fgy we introduce the function 

fl(n) = 5> P (<K«))logp. 
p\£ 

We also make use of the notation a a || n, which means that cf is the exact power of q divid- 
ing n, that is, cf divides n but q a+1 does not. 

Proposition 20. For any fixed £, we have fi{n) < (log log n) 2 for almost all n, in fact for all 
but Oi(x/ log log x) integers n < x. 



Proof. We have 



E//(n) = IE S> P (W))logp < *Elogp £ 



v p {A>(f)) 



n<x p\en<xq"\\ n p\£ (f<x 1 

p\t p"<x r p \£ q< x 1 
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Now 

X 



E lo §P E —jr <* x 

and, by ©, 



p\e p a <x V 






Hence, 



E. v-, log log X 

lo SP L „ fl <^ xloglog. 
pl<? «>1 p 

£/ £ (n) <Q x log log x, 



so that the number of n < x with /g(n) > (log log n) 2 is Oi{x/ log log x). D 

It is interesting that one can prove an Erdos-Kac theorem for fg(n) using as a tool the 
criterion of Kubilius-Shapiro (see Hill , |Q6|). 

Proof of the lower bound in Theorem^} Noting that <p(n)/g\ = 4>{n)/ef^ n \ we have (^(n)^ > 

$(n)/exp((loglogn) 2 ) for almost all n by Proposition l20l Of course, n > (p(n) ^$> 
n/ log log n for all n > 3. Therefore, using Lemma [19] and Theorem we have 

C(f.n) > Cl (l.«) > $&& > * W 



A(A(n)) ~ exp((loglogn) 2 )A(A(n)) 
<p(n)/n n 

exp((loglogn) 2 ) A(A(n)) 
= exp((l + o(l))(loglogn) 2 logloglogn) 
for almost all n. This completes the proof of the lower bound in Theorem |2J □ 

We now consider the upper bounds in Theorem|2j first establishing a lemma. 

Lemma 21. Suppose m is a positive integer and (d,m) = 1. For any integer j \ A(m), the 
number of integers u G [1, m] with (u,m) = 1 and ord(du,m) \ A(m)/j is at most 4>(m)/j. 

Proof. In fact, we prove a more general statement for any finite abelian group G: let A(G) 
denote the exponent of G, that is, the order of the largest cyclic subgroup of G, or equiv- 
alently the least common multiple of the orders of the elements of G. Then for any d G G 
and any j \ A(G), the number of elements u G G for which the order of du divides A(G)/j 
is at most #G/j. It is clear that the lemma follows immediately from this statement upon 
taking G to be (Z/raZ) x . It is also clear that in this statement, the element d plays no role 
whatsoever except to shuffle the elements of G around, and so we assume without loss of 
generality that d is the identity of G. 

Let p be any prime dividing A(G), and choose a < &sothatp fl ||/ andp 6 ||A(G). When we 
write G canonically as isomorphic to the direct product of cyclic groups of prime-power 
order, at least one of the factors must be isomorphic to Z/p b Z. In every such factor, only 
one out of every p a elements has order dividing A(G)/j, since all but p h ~ a elements of the 
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factor have order divisible by p b ~ a + 1 . Since there is at least one such factor for every p a \\ j, 
we conclude that at most one out of every / elements of G has order dividing A(G)/j, as 
claimed. □ 

Note that this result in the case d = 1 is Lemma 1 in |9[. The above proof, while similar 
in spirit to the proof in |9J, is simpler. 

Let t(w) denote the number of positive divisors of m. 

Proposition 22. For any integers £,n > Iwehave C(£,n) < nr(A(n))r(n) / ord* (£, A(n)). 

Proof. It is sufficient to show that for each £, n > 2 and each d \ n with (d, n/d) = 1, we 
have 

Let d | n with (d, n/d) = 1. We have seen in Lemma li"8l that for a residue u (mod n) with 
(u, n) = d to be involved in a cycle, it is necessary and sufficient that (£, ord(u, n/d)) = 1. 
For each integer / | A(n/d), let Ca r j(£,n) denote the number of cycles corresponding to 
residues u with (u, n) = d and ord(w, n/d) = A(n/d)/ j. Writing such a residue u as du\, 
we have u\ G [I, n/d] and {u\,n/d) = 1. Thus, by Lemma Izfl we have that the number 
of such residues u is at most 4>(n/d)/ j < n/d]. Hence we have 

d ' ][ ' } ~ ord(£, Mn/d)/j)' 

Now A(n/d) = A{n)/d\ for some integer d\ < d. It is shown in (15) of IIT2ll that for k \ m 
we have ord [a, m/k) > ord (a, m)/k for any nonzero integer a. Hence 

ord(£, A(n/d)/j) = ord(£,A(n)/d 1 i) > ord*(£,A(n))/d 1 j, 

so that 

n/dj n 

Cd >j( £ ' n ) ~ OT d*(^A(n))/di7 ~ ord*(^A(n))" 
Letting j range over all divisors of A(n/d), we get that 

nr(A(n/d)) 
^ A *" U) ~ ord*(£,A(n))' 
which immediately gives ((30)l . D 

Proof of the upper bounds in Theorem\2\ Note that from |6, Theorem 4.1], we have r(A(n)) < 
exp((log log n) 2 ) for almost all n. Furthermore, letting Cl(n) denote the number of prime 
factors of n counted with multiplicity, we know that the normal order of D.(n) is log log n; 
in particular, we have Cl(n) < log log n/ log 2 for almost all n. Since the inequality r(n) < 
2 n (") is elementary, this implies that r(n) < logn for almost all n. We conclude from 
Proposition |22] that 

C{£,n) < nexp(2(loglogn) 2 )/ord*(£,A(n)) 

for almost all n. 

The three upper bounds in Theorem |2] therefore follow respectively from three results 
in the new paper of Kurlberg and the second author |12J: Theorem 4 (1), which states that 
for any function e(n) — ► 0, we have ord* (£, A(n)) > n 1 / 2+£ ( n "> almost always; Theorem 
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22, which states that a positive proportion of integers n have ord* (£, A(n)) > n 592 ; and 
Theorem 28, which states that if the GRH is true, then 

ord*(£, A(n)) = n/exp((l + o(l))(loglogn) 2 logloglogn) 

on a set of asymptotic density 1. (Note that the proof of this result uses Theorem [l] of the 
current paper.) □ 

7. Higher iterates 

Here we sketch what we believe to be a viable strategy for establishing an analogue 
of Theorem [l] for the higher iterates \ where k > 3. As in the case of k = 2, we have 
generally that 

n n 4>k( n ) 

A k (n) (p k (n) A k (n) ' 

We always have n/(p k (n) < (clog log n) k , which is already a good enough estimate 
for our purposes. Even better, however, it is known |5| that for each fixed k, we have 
n/(p k (n) <C (log log log n) k for almost all n. The problem therefore reduces to comparing 
A k (n) to 4> k {n). Probably it is not hard to get analogs of Propositions |5] and |6j where we 
replace y 2 with y k . The problem comes in with the proliferation of cases needed to deal 
with small prime factors. As with the second iterate, we expect the main contribution to 
come from the "supersquarefree" case. In particular, let 

h(n) = ££••• £ £ v q (p k -l)logq. 

P\\npi\p\-\ p k \p k ^-l q<y k 

We expect h k (n) to be the dominant contribution to log((fo(n)//\/c(ft)) almost always. But 
it seems hard not only to prove this in general but also to establish the normal order of 
h{n). 

It would seem useful in this endeavor to have a uniform estimate of the shape 

El log log x — log log m . ^ -ii, ,-„ s 

- - —^ — ^ — -—^ — ^— for x > m 1+£ . (31) 

peV m ,p<xP <K m ) 



Even under the assumption of the Riemann Hypothesis for Dirichlet L-functions, (1311 1 
seems difficult, and maybe it is false. It implies with x = m 1 that the sum is ^C l/<p(m), 
when all we seem to be able to prove, via sieve methods, is that it is <^ (log log m)/<p(m). 
Assuming uniformity in d3Tt , it seems that on average 

■y 

hk ^ ~ (fcrw^ loglo8W ^ logloglo8W ' 

supporting Conjecture It would be a worthwhile enterprise to try to verify or disprove 
the Conjecture in the case k = 3, which may be tractable. 

Going out even further on a limb, it may be instructive to think of what Conjecture 
[3] has to say about the normal order of L(n), the minimum value of k with A k (n) = 1. 
The expression (l/(k — l)!)(loglogn) fc logloglogn reaches its maximum value when 
k ~ log log n. Is this formula then trying to tell us that we have L(n) <^ log log n almost 
always? Perhaps so. 
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There is a second argument supporting the thought that L(n) <C log log n almost al- 
ways. Let P(n) denote the largest prime factor of an integer n > 1, and let £{n) = P(n) — 1 
for n > 1, £{1) = 1. Clearly, £{n) \ A(n) for all n, so that if Lg(n) is the least k with 
£jc(n) = 1, then Lq{u) < L(n). It may be that the difference L(n) — Lq{ti) is usually not 
large. In any event, it seems safe to conjecture that Lq(h) is usually of order of magni- 
tude log log n, due to the following argument. For an odd prime p, consider the quantity 
log £(p)/ log p » log P(p — 1)/ log(p — 1). It may be that this quantity is distributed as p 
varies through the primes in the same way that log P(n)/ log n is distributed as n varies 
through the integers, namely the Dickman distribution. Such a conjecture has been made 
in various papers. If so, it may be that the sequence 

loglfr) logl 2 (p) 
logp ' log£(p) 

behaves like a sequence of independent random variables, each with the Dickman distri- 
bution. And if so, it may then be reasonable to assume that almost always we get down 
to small numbers and terminate in about log log n steps. A similar probabilistic model 
is considered in [1], but for the simpler experiment of finding the joint distribution of 
logarithmic sizes of the various prime factors of a given number n. 
At the very least, we can prove that L(n) <C log log n infinitely often. 

Proof of Theorem^ Notice that the definition of A(n) as a least common multiple, together 
with the fact that A(p fl ) | A(p a+1 ) always, implies that 

/\(lcm{rai,. . .,itij}) = lcm {A {m\ ),..., A (my) } 

for any positive integers m\, . . . , my. A trivial induction then shows that 

A fc (lcm{m 1 ,...,my}) = lcm {A k (rtii ),..., Afc(wy)} 

for any k > 0. Since the least common multiple of a set of numbers equals 1 precisely 
when each number in the set equals 1, we deduce that 

L(lcm{mi,. . .,my}) = max |L(mi),. . .,L(my)}. 

We apply this identity with ra ; = i. Let ny = lcm{l,2, . . .,/}. We have log rij = 
Y^i<j A(z), which is asymptotic to j by the prime number theorem. On the other hand, it is 
trivial that for any numbern we have L(n) < 1 + (l/log2) logn, as A !+ i(n) < (l/2)A z (n) 
for 1 < i < L(n). Therefore 

log 1 log j 



L(rij) = max{L(l),...,L(y)} < 1 + max i 2 ^^ 

log 2 (l^2 +0(1) ) 1Ogl ° g ^ 



log 7 

1+ hl 



□ 



We can improve on the estimate in Theorem HJ but not by much. Say we let Nj be the 
product of all primes p < y 329 with p — 1 | tij, with n, as in the above proof. It follows 
from Friedlander HSJ that a positive proportion of the primes p < y' 329 have the required 
property. Thus, Nj > exp(cy' 329 ) for some positive constant c and all sufficiently large 
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values of ;'. But A(Ny) | n ;/ so that L(N ; ) < 2 + //log 2. Hence L(N ; ) < .439 log log Nj 
for j sufficiently large. (This result can be improved by a very small margin using a more 
recent result of Baker and Harman |2], but the argument is a bit more difficult, since they 
do not get a positive proportion of the primes with the required property.) It is likely that 
L(n) <C log log log n infinitely often, possibly even that L(n) <Cjt log fc ft infinitely often 
for arbitrary fc-fold-iterated logarithms. 

One may also study the maximal order of L(n). The analogous problem for the iter- 
ated (^-function is relatively trivial, but not so for A. If there can exist very long "Sophie 
Germain chains", that is, sequences of primes p\, pi> ■ ■ ■ >Pk where each p, = 2p,_i + 1, 
for i > 1, then we might have L(p/t) ~ (1/ log 2) log p^. We might even perturb such 
a chain by a small amount and keep the asymptotic relation, say by occasionally having 
Pi = 4p,_ 1 + 1. It seems hard to prove that long enough chains to get the the asymptotic 
for L(pfc) do not exist, but probably they don't on probabilistic grounds. We can at least 
say that L(n) > 1 + (1/ log 3) log n infinitely often, since this inequality is attained when 
n is a power of 3. 
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